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1.  INTRODUCTION 

Recently,  optimal  estimation  and  detection  of  signal  processes 
generated  by  bilinear  dynamical  systems  has  been  the  subject  of  investi- 
gation in  a number  of  articles,  e.g.  [l],  [2]  and  [3].  Froblems  of  this 
type  arise  in  such  practical  applications  as  inertial  navigation, 
satellite  attitude  control  and  angle  modulation. 

We  consider  1 east-squares  filtering  of  stochastic  processes  gen- 
erated by  the  so  called  nilpotent  bilinear  djynatnical  system  driven  by  a 
Gauss-Markov  process  observed  in  white  gaussian  noise  environment.  In 
view  of  the  work  by  Fliess  [4]  and  Sussman  [5],  there  is  strong  reason  to 
suspect  that  the  above  class  of  systems  can  approximate  a much  wider 
class  of  nonlinear  systems.  The  existence  of  finite  dimensional , re- 
cursive filters  for  such  processes  was  established  in  [3]  under  a some- 
what weaker  Lie -a  1 gebrai c nilpotency  condition.  In  the  present  paper, 
explicit  stochastic  differential  equations  realizing  such  filters  are 
derived  by  "closing"  the  infinite  d imens i ona 1 nonlinear  filtering  equa- 
tions of  Kushner  [6].  It  is  found  that  the  filter  structure  is  not  only 
bilinear  but  it  also  inherits  the  nilpotency  property  from  the  original 
system. 

The  next  section  presents  the  problem  formulation.  The  third  sec- 
tion contains  the  derivation  of  filter  equations.  The  final  section 
concludes  with  an  example  and  con putat iona 1 considerations. 

2.  PR0E1.EM  STATEMENT 

We  first  delineate  the  class  of  nonlinear  filteiing  pioblems 
addressed  to  in  this  paper. 

We  are  given  the  standard  linear  Ito  models  for  the  signal  pn-rers 
-£f(t)^-  ? q and  the  observation  process  t '■  0 1 * s l'"1’ t * ve  iy. 

as  follows. 

The  signal  model: 

3 

d|(t)  = F(t)t(t)dt  + Q2(t)dw(t)  , t » 0 (2.1) 

The  observation  model: 

dz (t)  = H (t)  | (t) dt  + R'(t)dv (t)  , t > 0 (2.2) 

where  w(.)  and  v(.)  are  standard  N and  p dimensional  independent 
Wiener  processes  respectively,  t(t)r7J(^  , z(t)f  lR^,  f.  ( 0)  is  a zero  mi  an 
gaussian  random  vector  independent  of  w()  and  v(-)  pi  ocer. scr.  arid  F(-), 

1 1 

Qs(.),  H(.),  R^C*)  are  time-dependent  matrices  of  appropi iat e dimensions 
with  Q(t) , R(t)  positive  definite  and  continuously  differentiable  for 
all  t . 


Now  consider  a process 


lx(t)3 1*> 


generated  by  a b’ linear 


dynamical  system  of  the  following  form: 

N M 

dx(t)  = (A  + L Bili(t))x(t)dt,  x(t)€lR , t * 0 


(2.3) 


i=l 


where  x(0)  is  a gaussian  random  vector  independent  of  £(0), 

{w(t)3 1 * o*  (v(t)3 1 * o • 


We  are  interested  in  an  exact,  finite  dimens ionaJL  system  of  sto- 
chastic differential  equations  for  the  least  squares  estimator 

Efx^lz1"]  ^x  (tit)  of  x(t).  It  is  well-known  (see  e.g.  [6])  that 

this,  in  general,  is  not  possible.  In  this  paper,  we  intend  to  derive 
such  a filter  and  study  its  characteristics  with  the  proviso  that  the 

matrices  A and  w in  (2.3)  satisfy  a well-known  algebraic 

property  called  nilpotency. 

Definition  1: 


An  associative  algebra  6/^9  of  matrices  is  said  to  be  niljpotent 
(of  order  K)  if  there  exists  a positive  integer  K such  that  the  (matrix) 
product  of  any  K or  more  matrices  in  G/A©  vanishes. 


Ajs  s umption  2 : 


The  associative  algebra 


generated  by  the  set  of  matrices 


{AVV|i=l,2 N;  k -0, 1 ....  } is  nllpotent  <f’f  ort1er  K»  ^ where 


AW  = Bi 


and 


AdA(V  = [A*  AdA1(Bi)]  = A.Ad^4(Bt)  -Ad^'^B^A 


k- 1 , 


.k-1 


for  k = 1,2,... 
Remark: 


Assumption  2 is  stronger  than  the  l„ic -algebraic  nilpotency  assump- 
tion made  in  [3]. 


3.  THE  STRUCTURE  OP  THE  NONLINEAR  KILTER 


The  filtering  problem  posed  in  the  last  section,  was  shown  to  be 
amenable  to  solution  in  [3].  In  the  main  theorem  of  this  paper  that 
follows  shortly,  we  explicitly  exhibit  the  structure  of  the  nonlinear 
filter  and  study  its  characteristics.  We  begin  with  some  lemmas. 


Lemma  3 : 


The  input-output  map  realized  by  the  system  (2.3)  satisfying 
Assumption  2 an  also  be  realized  by  a bilinear  system  which  is  a 
"direct  sum'  of  (a  finite  number  of)  finite  dimensional  systems  of  the 
following  form 


dxk(t)  = (Ak  + Z Bk  xk(t)dt,  t * 0 

i -1  1 1 


- 2 - 


(3.1) 


where. 


A*  k°N  - - - - ? 

Ov  *2  'v  1 

I V I 


X V VI 

V V N 

x ' -0 

- - - 1 o Ak 


(3.2) 


with  A^  , 1=1,  ...,  being  X Jordan  blocks,  and 


(3.3) 


0 B1#1  0 0 

| 0 N Bk  - 0 0 

• Nn  1.2  v , 

1 

I N I 

1 '°  Bk  . 


with  j , i=l,...,N;  j=l,...,M^-l  being  X J^  + ^ matrices. 

Hence,  the  associativa  algebra  {a<£  («~)|Mtl>  j,, „}  AA 

is  also  nilpotent  of  the  same  order  K. 

Proof: 

The  proof  follows  closely  the  ideas  in  the  proofs  of  theorems  4 
and  6 of  Brockett  [7]  and  hence  is  kept  brief. 

-At 

Using  the  transformation  z(t)  = e x(t)  and  applying  the  Peano- 
Baker  formula,  we  find  that  the  Volterra  series  solution  to  (2.3)  is 
given  by 

x(t)  = [eAt  + / eA^t-ori^(  T,  B £ (o  ))eACIldo 

r\  • n *•  *■ 


+ / / 1eA(t  ■<Ii)(  E Bi£.(o1))  *eA(CTi  ' 
o o i=l 


( E B.|.(a  ))eAo*d<r  dff  + ] x(0) 

j=l  J J 


(3.4) 


It  now  follows  from  the  Baker-Campbell-Hausdorf f formula  together  with 
Assumption  2 that  all  the  terms  in  (3.4)  beyond  the  first  K terms  will 
vanish. 

Now,  let  the  Jordan  form  of  A be  given  by  diag  £a^  . • • , A^ 
where  At  is,  say,  J^^  X JA  . l.et  ^ ^E^  J^,  1*1,... ,s  with 


- 3 - 


K 4 0,  and  define  y*  4 [x  - x ]T  , i = l,...,s  . Recall  that 

At  1 i+1 

e consists  of  J.X  J,  blocks,  i,k=l,...,s  which  can  be  written  as 
s a n t i x 

Z C.e  ^ D , for  some  J,  X J,  matrices  C„  and  J„  x J.  matrices  D„  . 
£ = j i l ii  f.  I k £ 

With  these  remarks  in  mind,  it  is  not  difficult  to  see  that  the  rtb  order 
term,  1 § r < K , in  the  Volterra  series  (3.4),  for  the  subset  y^(aQ)  of 
components  of  x(ao),  i=l,...,s,  consists  of  groups  of  terms  of  the  form 


o o o m,  , . 


Al(cTo_cri>  A2(ai“a2)  Ar0r  k 

Cle  Die  D2...Dre  y (0) 


TT  £ (o\)da. 

. , m.  x l 
1=1  i 

But  each  group  of  this  type  is  realizable  by  a system  of  the  type  claimed 
— N 

in  the  lemma  with  A and  «. Bj * ^ as  in  (3.2),  (3.3)  (see  theorem  4 of 

[7]  for  details).  The  final  assertion  regarding  nilpotency  is  obvious  if 
we  note  that  ~K  is  upper  triangular  while  T^,  i=l,...,N  are  strictly 

upper  triangular. 

Lemma  4 : 

Consider  the  signal  and  observation  models  of  (2.1)  and  (2.2) 
respectively.  Define  a (vector)  process  {y(t)}  t ? o by 

N 

dy (t)  = Dy(t)dt  + Z E E. (t)y(t) 
i=l  1 

f”  N 

where  D,  are  matrices  of  appropriate  dimension, 

y(0)  is  independent  of  £(0),  w(-)  and  v(*)  processes.  (3.5) 

Then  y(t|t)  4 E[y(t)/zt]  satisfies  the  following  stochastic  differential 
equation: 

N 

dy(t|t)  = Dy (t | t) dt  + Z E.-E  [ 6 . ( t) y (t) ]dt 


+ £Et[y(t)5T(t)]  - y(t|t)^T(t|t)J  HT(t)R'1(t)dV(t) 


y(0|0)  = E[y(0)]  (3.6) 

where, 

Et[ - ] 4 Ef-lz*]  4 e[ • | £z(t)  | osn  * t)  ] 


d v( t)  = dz(t)  - H(t)t(t|t)dt  . 


Proof : 


Apply  the  Kushner  nonlinear  filtering  equations  [6]  to  the  signal 
process  (yT(. ) , • ) )T  with  z(-)  as  the  observation  process. 


Lemma  5: 


Let  x = (Xq,x^, . . . , x^)  be  a gaussian  random  vector  with  mean 
vector  m = (m^.m^, . . . >ran)T  and  covariance  matrix  P = q. 
We  then  have  the  following  relation: 


E[  TT  x ] = 
1=0  1 


{n  n n 

mnE[  IT  x ] + L Pn.E[  TT  x ],  n 
1=1  j=l  J i=2  1 


> 1 


momi  + P01 


= 1. 


(3.7) 


Proof:  See  e.g.  [8]. 

We  are  now  ready  to  state  and  prove  the  main  theorem  of  this  paper. 
Theor era  j6 : 

Consider  the  signal  process  {x(t),|(t)}  0 evolving  as  in  (2.3) 

and  (2.1)  and  the  observation  process  ^z(t)J  t^Q  as  1°  (2.2).  Assume 

that  (2.3)  satisfies  Assumption  2.  Then  the  least  squares  filtered 
estimate  $(tjt)  can  be  obtained  via  the  following  finite  dimensional 
system  of  bilinear  stochastic  differential  equations  with  suitable  initial 
conditions : 

N N 


dx*t  1 1)  = (A*(t)dt  + £ B *|  (tjt)<k  + £ C *du.  (t))x*(t|t)dt 

i=l  1=1 


x(t|t)  = Lx*(tlt)  , t £ 0 


(3.8) 


where  £(t|t),  t S 0 is  given  by  the  standard  Kalman-pucy  filter  [9]  , 
A'V(-)  is  a deterministic  matrix  time  function.  L, 

constant  matrices,  and 


-T  A ,C..T  - 'I 


n(t)  ^ Cp1(t)  Mo(t) ,l(t)3  0 / H (l)  RO)dV(l) 

N o 


is  the  modified  innovations  process. 


Furthermore,  the  associative  algebra  generated  by  the  following 
set  of  matrices. 


0 


Ad 


(Ad^  * (A*(t) ) ) 


}<BJ  } 1»  j*l»2, . . . ,N;kt X=0,l,2, . . .}  1 *° 


is  nilpotent  of  order  K. 
Proof: 


From  Lemma  3,  it  is  clear  that  we  may  take  - without  loss  of 
generality  - (2.3)  to  be  a stationary  bilinear  system  in  which  the  matrices 

A and  Oil  are  of  the  special  form  stipulated  in  (3.2)  and  (3.3) 

respectively.  Furthermore,  we  shall  assume  for  ease  of  exposition,  that 


5 


A = diag(a^ ,a^, . . . ,a^) ; consequently  B^  £ [b^*’^]  is  such  that 

i a r° . j^+i 

~ < k » k=l> • • • »N  and  x^ (0)  =0  a.  s.  for  i < M.  For , 

(b*  , j = i+1 

it  would  become  apparent  in  the  sequel, the  proof  for  the  general  case 
would  involve  little  more  than  additional  notational  ccmpl exi ty . 

We  now  apply  Lemma  4 to  (2.3)  with  the  above  mentioned  special i nation. 
With  the  help  of  Lemma  5,  the  resulting  differential  equation  analogous 
to  (3.6)  can  be  written  in  the  following  form: 


a ^ a i 

dx(t | t)  = (A  + Z B. |.(t|t))x(t|t)dt  + v B xJ(t |t)dt 

j=l  J J j-1  J 

A A /*  -1 

+ [X1(t|t),  X2(t|t),. . . ,xN(t|t)]HT(t)R  (t)dv(t)  (: 

where  the  "augmenting  states"  appearing  in  (2.17)  are  defined  as 

*P 

xj(-)  0 [xj(0,  x^(. ),...,  XJJ(-)]  , j-l,?,...,N  , 


(3.9) 


k,  \ 

yv  = 


M-q  ffOffl>q-l 


N M-q  u0  "1  1 

Z Z / / / 

_ -1  r -1  o o o 

1*”  • ,mM-q_1  1 


pk,™  (V°r)e 

* r 


a (a  -a  ) 
qfr  r rfl 


M-q- 1 . . 

m a cr  a 

■b  r eMM'V (0)(  !r  eqfl  1 111  miH  -1 

^fr  * i^O 


(CT.  i)da,  q < M 
mi+l  1+1 


\^0,  q =M  (3.10) 

(Note:  In  the  general  case  x (cr())  would  he  a vector  of  a dimension 

consistent  with  the  qtk  Jordan^bl ock) . 

and’  P.  .(t,cx),  i , j=l .... ,N  ; 0 s a -s  t , 

1 > J 

is  the  i,jth  element  of  the  conditional  covariance  matrix 

P(t,a)  = E[(£(t)  -?(tjt))(5(ff)  - i(o|t))  t ] (3.11) 

Also  recall  [10]  that  P(t,cr),  a 5 t satisfies  the  following  differential 
equation 

SP(t.cr)  _1 

— -V-- ■-  = [F(t)  - P(t)H  (t)R  (t)H(t)]P(t,o)  . (3.12) 


Now,  i f we  differentiate  the  augmenting  state  vector 

'*(■>  4 [x£>  xiT,  ...  xf/.IR""  . 


(3.13) 


(3.14) 


then  utilizing  (3.12),  we  find  that 

d*x(t)  = ft1(t)1x(t)dt+  p1(t)x(t)dt  + Z Y1  1x(t)f,.(t)dt 

i = l 1 1 

where  Q^(-),  p^(-),  ■.  arc  determined  by  the  problem  pm  nmet  ers 

1-  i J i -l  j 1 

and  the  error  covariance  P(*),  the  (M  x M)  blocks  of  u (•)  and  p (•)  are 
tipper  triangular  and  { j ^ j art'  block  diagonal  matrices  with  identical 
strictly  upper  triangular  blocks  given  by 

0 bj  0 -0 

0 0 b*  0 0 

\ 2 1 
• X I /\  « « • r *■  * i r \ 


\ \ 

N ' K1 

S S-2 


a block  of  y^  (3.15) 


s 

0 0 


Now  we  may  again  apply  Lemmas  4 and  5 to  (3.14),  and  get 


dXx(t|t)  = (a1(t)  + £ y*  £.(t|t))x(t(t)dt  f p (t)x(tf t)dt 

i=l  . 

N A r\  T -1 

+ E Yj  1xj(t|t)dt  + [1*1(t|t)f  1x2(t|t),...,1xN(t|t)]n(t)R  (t)dv(t) 

1=1  iai 


wliere  the  "new"  set  of  augmenting  states  are 

Vc-)  0 [xj,1(.),xj>2(.),...  xj’N(.)]T  , j=l,...,N 

xJ’V)  A [x^k(.),  x]’k(.  ),..., 


(3.16) 


x^k(a  ) = 
q o 


M-q  ao  C1  °M-q-l 


L t l l"  l Pi.'"  (°p 
,nVl-q  -1  rl,r2  1 * r! 


(o  ,o  )P,  • (u  ,o  ) 

v o’  r | k,m  o’  r ' 


r\*X2 

(o-_  -a  ,.)  m 


qfrl  rl  rl  1 


a (o  -o  ,.) 

qtr2  r 2 V1 


q + r 


aM°M-q  M-q-1  3q+i^°i  °i+^ 


XM(0)(  IT  € 
" 10 

i/r1,r2 


• b .1  (o  . , , ) tin  ’ 
qtib  l+l  i *1 


q < M-l 


Now  the  new  augmenting  state  vector 

T i o T 


x(-)  £ [Vo 


1 2 


x (•) 


VY.fl? 


MN 


(3.18) 


analogous  to  (3.16),  is  seen  to  satisfy  the  differential  equation 

d^x(t)  - Q2(t)2x(t)d  + (t)  *x(t)«..  + ?:  y2  2x (t)  f. . (t ) dt  (3.19) 

i-=l  1 1 

2 2 2 

in  which  MxM  blocks  of  a (•),  3 (•)  and  y.  have  the  name  properties  as 

112  1 2 
those  of  OL  (•),  3 (•)  and  y.(-)  respectively  with  a block  of  y.  obtained 

i 1 1 1 

by  setting  b _ = 0 in  the  block  (3.15)  of  y.  . The  above  process  can 
M-  t l 

now  be  iterated  and  it  is  clear  that  this  sequence  will  terminate  at  the 

M*”*1  application  - in  the  general  case,  this  would  he  the  total  number  of 

M-l 


Jordan  blocks  - of  Kushner  equations  since  y. 


0,  i-1 , 2 , . . . , N.  Col- 


lecting together  the  differential  equal  ions  for  x(-),  x(.),. 


M-l 


x ( • ) , we 


see  that  the  dimension  of  the  resulting  nonlinear  filter  equals 

(M  !-MN  iMN2+.  . . IMN)^*  ^ (N^-l).  The  filter  foiin  (3.8)  follows  upon 

writing  the  innovations  term  in  the  standard  bilinear  fnimat,  which  also 

c -.V)u 

reveals  the  fact  that  the  matrices  <C.r.  , consist  of  upper  triangular 

r'l  1,i=1 

tB.\  . , consists  of  (strictly)  upper 
1>1=1  r * 

and 


con si st 


MxM  blocks.  We  thus  see  that  ■ 
triangular  (MxM)  blocks,  while 
of  (MxM)  blocks  that  are  upper  triangular.  It  is,  therefore,  easy  to 
see  - if  we  carry  out  matrix  mu  1 1 ipl i cat  ions  blockwise  - that  the  set  of 


t * o nnd  { =i *}  i-1 


matrices 


^dc^*  (A*(t)  ^ 1 1 ao;  t-i N;T-0,1,..}  a,s°  1,;‘ve  (MXM)  M,’cks 

Consequently  the  set 


all  upper  triangular. 

,k 


^Ad 


(Ad*  *(A*(t) ) ) 
Li 


CE 


j ] t K) ; i , j - 1 , . . . , N ; k , f =0 , 1 , 2 , . . .^ 


has  (MxM)  blocks  that  are  all  strictly  upper  triangular.  llonce,  block- 
wise  multiplication  shows  that  the  associative  algebra  g.neiated  by  them 
is  nilpotent  of  order  K.  0 . K - 1) . 

The  filter  structure  revealed  in  t.he  above  theorem  has  some  interest- 
ing features  worth  noting.  We  first  see  that  analogous  to  the  case  of 
linear  signal  models,  the  "drift"  terms  in  (3.8)  do  preserve  the  bilinear 
structure  of  the  signal  model  (2.14)  and  also  inherit  its  nil  potency 
property;  nevertheless  the  matrix  A',v(')  is  time  varying  although  the 
matrix  A is  constant.  Secondly,  the  filter  is  bilinear  in  the  innovations 
as  well.  Also  observe  that  the  state  spaces  of  both  the  signal  model  and 
the  filter  are  (not  necessarily  identical)  nilpotent  group  manifolds. 

This  suggests  that  it  may  be  worthwhile  to  investigate  optimal  fillers 
under  criteria  that  are  defined  on  such  manifolds.  Such  an  approach  for 
abelian  groups  was  followed  in  [1]. 

4.  COMPUTATIONAL  CONS  I DK RAT  I OKS 

Realization  of  the  filter  (2.16)  in  the  form  of  a block  schematic 
is  shown  in  figure  1 on  the  following  page.  The  practical  ? mpoi  I .nice  of 
the  bilinear  property  of  the  above  filter  is  that  a rfal-li.me  analog  im- 
plementation of  the  filter  is  still  possible  with  easily  available  and 
cheap  hardware  consisting  of  integrators,  summers  and  multipliers. 


Figure  1 

However,  a major  drawback  of  the  filter  is  its  "curse  of  dimensional i ty". 
The  following  example  illustrates  the  problem  and  shows  one  way  of  its 
mitigation.  Example:  We  now  apply  the  algorithm  developed  in  the  theorem 
to  construct  the  optimal  filter  for  the  case  of  system  (2.3)  with  M = 3, 
~1  0 o-j  -Ob}  o- 

N = 2 and  A = 0 a2  0 , B.  = 0 0 b*  , i = 1,2 


0 0a 


0 0 0 


The  filter  is  found  to  have  a dimension  of  21,  having  required  3 
applications  of  Kushner's  equations.^  We  enumerate  below  the  nonzero 

3x3  blocks  of  A"(t),  <Bkj  ^ ^ and  L.  The  notation  used 

is  as  follows : **  J 

T*  ’ ^ : i.j*’*1  block  of  matrix  T 

P (t>,€  (t):  i , 1 eleir.ents  of  matrices  "(t)  and 

i»J  T _i 

[F(t)-P(t)H  (t)R(t)H(t) ] respectively. 

&(•):  Kronecker  delta  function. 

and  finally,  for  any  nxn  matrix  T and  0 % i --  n. 


T : Matrix  T with  last  i rows  and  columns  replaced  by  zeros. 

*1  1 Ar 1 i 1 

A L’L(t)  = A;  A 1,J(t)  = B ... 

J " A » J 9 J 9 

2 

i . j=2,3; 


2 

’ ( t) 

= 2 >: 

i = l 

*i,2(t) 

2 

= Z P 
k=l 

= [A2 

- fA2 

•,V7  3 

■ * (O 

2 

= 2 L 

2,kvw  V 


..  S'’*, 


(t)I3l,  j-  4,5; 


l,kvw°k’ 


*2,  (j-3)  (t)T3^*  i=5,b; 
l,(j-5)(t)13' ’ i=5’6;  j 6,7; 


- 9 - 


A*?’j(t)  = 2f A2  + f2,j(j-5)(t>13^  » j =  5  6'7 8 9 10 

<1,1  * v C’H'i=2'3;  v=1’2 

c*1,2  » 5(k-l)  I*  ; <!»3  = 6(k-  2)  • ; 

c*2,4  = &(k-i)*i3  ; ck2’6  = ^(k-2) ' I3  ; 


- 6(k - 1) • ; 


= b(k - 2)  l3  ; k - 1,2 


In  the  above  matrices,  note  the  repetition  of  certain  block  rows 
and  the  presence  of  some  all  zero  lows.  This  allows  us  in  effect  to 
reduce  the  filter  dimension  from  21  to  10.  This  argument  can  be  extended 
to  the  general  case,  and  it  is  not  difficult  to  show  that  the  dimensional- 
ity can  be  reduced  in  this  way  from 

M M M-l  K+i-l 

„i  (n"-1)  to  Z (M - i > ( . ) 

N_1  i-0  1 
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